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Abstract 



A search for supersymmetry or other new physics resulting in similar final states is 
presented using a data sample of 4.73 fb -1 of pp collisions collected at y/s = 7 TeV 
with the CMS detector at the LHC. Fully hadronic final states are selected based on 
the variable Mj2, an extension of the transverse mass in events with two invisible 
particles. Two complementary studies are performed. The first targets the region of 
parameter space with medium to high squark and gluino masses, in which the signal 
can be separated from the standard model backgrounds by a tight requirement on 
Mj2- The second is optimized to be sensitive to events with a light gluino and heavy 
squarks. In this case, the Mj2 requirement is relaxed, but a higher jet multiplicity and 
at least one b-tagged jet are required. No significant excess of events over the standard 
model expectations is observed. Exclusion limits are derived for the parameter space 
of the constrained minimal supersymmetric extension of the standard model, as well 
as on a variety of simplified model spectra. 
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1 Introduction 

A broad class of extensions of the standard model (SM) predict the existence of heavy colored 
particles that decay to hadronic final states accompanied by large missing transverse energy 
(£™ 1SS ). The best known of these scenarios is supersymmetry [1J (SUSY) with R-parity conser- 
vation. In this paper we present a search for such new physics in pp collisions collected with 
the Compact Muon Solenoid (CMS) detector at the Large Hadron Collider (LHC) at a center-of- 
mass energy of 7 TeV. The results are based on the data sample collected in 2011, corresponding 
to about 4.73 fb _1 of integrated luminosity. 

The search makes use of the "stransverse mass" variable Mj2 EH31 to select new physics can- 
didate events. Mj2 is the natural extension of the transverse mass Mj to the case where two 
colored supersymmetric particles ("sparticles") are pair-produced and both decay through a 
cascade of jets and possibly leptons to the lightest supersymmetric particle (LSP). The LSP is 
not visible in the detector and leads to a missing transverse momentum signature. Although 
Mt2 was originally introduced to derive the masses of sparticles involved in the cascade de- 
cay we use it here as a discovery variable since it is sensitive to the presence of SUSY-like new 
physics. The distribution of Mj2 reflects the produced particle masses, which are much lighter 
for the SM background processes than for the SUSY processes. Hence, new physics is expected 
to appear as an excess in the tail of Mx2- 

The analysis is based on two complementary approaches. A first approach, the "Mj2 analy- 
sis", targets events resulting from heavy sparticle production, characterized by large E™ 1SS , at 
least three jets, and large Mj2- The SM backgrounds in the signal region consist of W(£t/)+jets, 
Z(vv)+jets, tt, and single-top events (the last two will be referred to collectively as top-quark 
background), which are estimated from data-control regions and simulation. This analysis 
loses sensitivity if the squarks are heavy and the gluinos light, in which case the production is 
dominated by gluino-gluino processes. The gluinos give rise to three-body decays with rela- 
tively small E™ 1SS . Since the gluino decay is mediated by virtual squark exchange and the stop 
and sbottom are expected to be lighter than the first- and second-generation squarks, these 
events can be rich in b quarks. To increase the sensitivity to such processes, a second approach, 
the "Mt2& analysis", is developed, in which the threshold on Mj2 defining the signal region is 
lowered. To suppress the QCD multijet background, we demand at least one b-tagged jet and 
place a stricter requirement on the jet multiplicity. The Mt2& analysis provides a larger signal- 
to-background ratio in the region of heavy squarks and light gluinos and hence improves our 
sensitivity to this scenario. 

This paper extends previous results of searches in fully hadronic final states from the CMS 0]- 
7] and ATLAS |8HTTI Collaborations. It is organized as follows: after a brief introduction to 
Mt2 and its salient properties in Section [2} and a description of the CMS detector in Section |3} 
we present in Section [4] the data samples used and the event selection. In Section |5} the search 
strategy is presented. This strategy is applied to the Mj2 analysis in Section|6]and to the Mi?b 
analysis in Section [7j In these sections the background estimation methods are also discussed. 
We interpret the results in Section [8] in the context of the constrained minimal supersymmet- 
ric standard model (CMSSM) as well as for a variety of simplified models. Finally, Section [9] 
contains a summary. 

2 Definition of M T2 

The variable Mj2 was introduced [2] to measure the mass of primary pair-produced particles 
in a situation where both ultimately decay into undetected particles (e.g., LSPs) leaving the 
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2 Definition of M T2 



event kinematics underconstrained. It assumes that the two produced sparticles give rise to 
identical types of decay chains with two visible systems defined by their transverse momenta 

Pj ls<K '\ transverse energies Ej is ^'\ and masses m vis ^. They are accompanied by the unknown 

LSP transverse momenta pj ■ In analogy with the transverse mass used for the W boson mass 
determination fl2|, we can define two transverse masses (i = 1,2): 

(M T °) 2 = K is ( ! '))2 + m l + 2 (4 is(,) £f (i) - p T visW • p^\ . (1) 

These have the property (as in W-boson decays) that, for the true LSP mass m^, their distribu- 
tion cannot exceed the mass of the parent particle of the decay and they present an endpoint 

at the value of the parent mass. The momenta pj of the invisible particles are not experi- 
mentally accessible individually. Only their sum, the missing transverse momentum p.^ 8 , is 
known. Therefore, in the context of SUSY, a generalization of the transverse mass is needed 
and the proposed variable is Mj2- It is defined as 

M T2 (m^) = min [max (m$\ M t 2) ) 1 , (2) 

r^^(l) i remiss L \ /. 

Ft ~tFt — "t 

where the LSP mass remains a free parameter. This formula can be understood as follows. 

(1) (2) 

As neither M T nor M T can exceed the parent mass if the true momenta are used, the larger of 
the two can be chosen. To make sure that Mj2 does not exceed the parent mass, a minimization 
is performed on trial LSP momenta fulfilling the p™ ss constraint. The distribution of Mj2 for 
the correct value of then has an endpoint at the value of the primary particle mass. If, 
however, is lower (higher) than the correct mass value, the endpoint will be below (above) 
the parent mass. An analytic expression for Mj2 has been computed [13] assuming that initial- 
state radiation (ISR) can be neglected. In practice, the determination of Mi2 may be complicated 
by the presence of ISR or, equivalently transverse momentum arising from decays that occur 
upstream in the decay chain [14|. In this case, no analytic expression for Mj2 is known, but it 
can be computed numerically, using, e.g., the results of Ref. [15|. 

To illustrate the behavior of Mj2, we consider the simple example of Mj2 without ISR or up- 
stream transverse momentum. As discussed in Ref. [13|, the angular and pj dependence of 
Mj2 is encoded in a variable Aj: 

A T = E™ (1) E™ {2) +p™ {1) -p™®, (3) 

and Mj2 increases as Ax increases. Therefore, the minimum value of Mj2 is reached in configu- 
rations where the visible systems are back-to-back. The maximum value is reached when they 
are parallel to each other and have large pj. In the simple case where = and the visible 
systems have zero mass, Mj2 becomes 

(M T2 ) 2 = 2Aj = 2p T is(1) p T is(2) (l +cos^ 12 ), (4) 

where (p\2 is the angle between the two visible systems in the transverse plane. It can be seen 

that Eq. Q corresponds to the transverse mass of two systems (Mj) 2 = 2p T ys ^p T ys ^(l — 
cos <p\2), with pj is = —pj YS for one of the systems. 

In this paper, we use Mj2 as a variable to distinguish potential new physics events from SM 
backgrounds. The use of Mj2 as a discovery variable was first proposed in Ref. [16J , but here 
we follow a different approach. Several choices for the visible system used as input to Mi2 
can be considered: dijet events (as in Ref. [16]), the two jets with largest pj in multijet events, 
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or two systems of pseudojets defined by grouping jets together. In this study we use the last 
method. 

A technique to group jets in multijet events into two pseudojets is the "event hemispheres" 
method described in Ref. [17] (see Section 13.4). We take the two initial axes as the directions of 
the two massless jets that yield the largest dijet invariant mass. The pseudojets are then formed 
based on a minimization of the Lund distance criterion |[T7l[T8| . 

We use Mj2 as our main search variable since SUSY events with large expected E™ 1SS and jet 
acoplanarity will be concentrated in the large Mj2 region. In contrast, QCD dijet events, in 
which the two jets are back-to-back, populate the region of small Mj2 regardless of the value 
of E™ ss or jet pj. In the present study, we choose the visible systems to be massless and set 
m% = 0. Then back-to-back dijet events will have Mj2 = 0, as explained above. Hence, Mj2 
has a built-in protection against jet mismeasurements in QCD dijet events, even if accompanied 
by large E™ 1SS . However, QCD multijet events with large E™ 1SS may give rise to acoplanar 
pseudojets, leading to larger Mj2 values. For this reason, further protections against E™ 1SS from 
mismeasurements need to be introduced, as described below. Other SM backgrounds, such 
as tt, single top-quark, and W+jets events with leptonic decays, or Z+jets events where the Z 
boson decays to neutrinos, contain true E™ 1SS and can also lead to acoplanar pseudojets. 

3 CMS detector 

The central feature of the CMS apparatus is a superconducting solenoid 13 m in length and 
6 m in diameter that provides an axial magnetic field of 3.8 T. The core of the solenoid is instru- 
mented with various particle detection systems: a silicon pixel and strip tracker, an electromag- 
netic calorimeter (ECAL), and a brass/ scintillator hadron calorimeter (HCAL). The silicon pixel 
and strip tracker covers \t]\ < 2.5, where pseudorapidity rj is defined by r\ = — In [tan (9/2)} 
with 9 the polar angle of the trajectory of the particle with respect to the counterclockwise beam 
direction. The ECAL and HCAL cover \rj\ < 3. The steel return yoke outside the solenoid is 
instrumented with gas detectors used to identify muons. A quartz-steel Cerenkov-radiation- 
based forward hadron calorimeter extends the coverage to \r/\ < 5. The detector is nearly 
hermetic, covering < (p < 2n in azimuth, allowing for energy balance measurements in the 
plane transverse to the beam directions. The first level of the CMS trigger system, composed 
of custom hardware processors, uses information from the calorimeters and muon detectors 
to select the most interesting events in a fixed time interval of less than 4 /ys. The High Level 
Trigger processor farm further decreases the event rate from around 100 kHz to around 300 Hz, 
before data storage. A detailed description of the CMS detector can be found elsewhere [19] . 

4 Samples and event selection 

The data used in this analysis were collected by triggers based on the quantity Hj, the scalar 
sum of transverse momenta of reconstructed and energy-corrected calorimeter jets. Due to a 
continuous increase in the instantaneous luminosity of the LHC, the trigger evolved with time 
from the requirement Hj > 440 GeV to Hj > 750 GeV. In this analysis, only triggers with a 
threshold of 650 GeV or less have been used, corresponding to a total integrated luminosity of 
4.73 fb _1 . 

The analysis is designed using simulated event samples created with the PYTHI A 6.4.22 [ 18 1 and 
MadGraph 5vl.l fl20]| Monte Carlo event generators. These events are subsequently processed 
with a detailed simulation of the CMS detector response based on Geant4 ]2T|. The events are 
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reconstructed and analyzed in the same way as the data. The SUSY signal particle spectrum is 
calculated using SOFTSUSY EH and for the decays SDECAY |23l is used. We use two CMS SUSY 
benchmark signal samples, referred to as LM6 and LM9 l|T7l , to illustrate possible CMSSM |24| 
yields. The CMSSM is defined by the universal scalar and gaugino mass parameters mo and 
respectively the parameter Aq of the trilinear couplings, the ratio of the vacuum expecta- 
tion values of the two Higgs fields tan ft, and the sign of the Higgs mixing parameter sign (ji). 
The parameter values for LM6 are mo = 85GeV, m 1/2 = 400 GeV, tan /3 = 10, Aq = OGeV and 
sign(^) > 0. Those for LM9 are m = 1450 GeV, m 1/2 = 175 GeV, tan $ = 50, A = OGeV 
and sign(^) > 0. All samples are generated using the CTEQ6 |25] parton distribution func- 
tions (PDFs). For SM background simulated samples we use the most accurate calculation of 
the cross sections currently available, usually with next-to-leading-order (NLO) accuracy. For 
the CMS SUSY benchmark signal samples we use NLO cross sections of 0.403 pb and 10.6 pb 
for LM6 and LM9, respectively, obtained by weighting the leading order cross sections from 
pythia with sub-process dependent K-factors calculated with PROSPINO 11261 . 

The events are reconstructed using the particle-flow (PF) algorithm [27J, which identifies and 
reconstructs individually the particles produced in the collision, namely charged hadrons, pho- 
tons, neutral hadrons, electrons, and muons. 

Electrons and muons with pj > 10 GeV and \rj\ < 2.4 are considered isolated if the transverse 
momentum sum of charged hadrons, photons, and neutral hadrons surrounding the lepton 
within a cone of radius y/ (A?/) 2 + (A(p) 2 = 0.4, divided by the lepton transverse momentum 
value itself, is less than 0.2. The electron and muon reconstruction and identification algo- 
rithms are described in Refs. Il28ll29| and [30j, respectively. All particles apart from the isolated 
electrons and muons are clustered into jets using the ardi-kj jet clustering algorithm [31J with 
distance parameter 0.5 |32ll33| . Jet energies are calibrated by applying correction factors as a 
function of the transverse momentum and the pseudorapidity of the jet. Residual jet energy 
corrections are applied to jets in data to account for differences in jet energy scale between 
simulation and data [34|. The effect of pileup, namely multiple pp collisions within a beam 
crossing, is reduced by using the Fast Jet pileup subtraction procedure |35J[36l for data and sim- 
ulated events. Jets are required to pass loose identification criteria and to satisfy pj > 20 GeV 
and \t]\ < 2.4. The b-jet tagging is based on the simple-secondary-vertex algorithm [37|. We use 
the high-purity working point that yields a typical jet-tagging efficiency of 42% for b jets in our 
search region while the mistagging efficiency for light-flavored (uds quark and gluon) jets is of 
the order of 0.1% and for c jets, 6.3%. The missing transverse momentum E™ ss is computed as 
the negative vector sum of all particles reconstructed by the PF algorithm |33| . 

Events are required to contain at least one good primary vertex 1138 1 . The Hj value, computed 
from PF jets with pj > 50 GeV, must satisfy Hj > 750 GeV. With this Hj requirement, the 
triggers are nearly 100% efficient. At least three jets are required, where a pj threshold of 
40 GeV is used for jet counting. The two leading jets are required to have pj > 100 GeV. The 
value of E™ 1SS is required to exceed 30 GeV. Events containing beam background or anomalous 
calorimeter noise are rejected. To reject events where a significant fraction of the momentum 
imbalance arises from forward or soft jets, a maximum difference of 70 GeV is imposed on the 
modulus of the difference between the E™ 1SS and H™ 1SS vectors, where H™ 1SS is the negative 
vector sum of all selected jets. Events containing jet candidates with pj > 50 GeV that fail the 
jet identification criteria are also rejected. 

To reduce the background from QCD multijet events with large E™ 1SS , arising from mismeasure- 
ments or leptonic heavy flavor decays, a minimum azimuthal difference A^) m i n (jets, E^ 1SS ) > 
0.3 is required between the directions of Eip iss and any jet with pj > 20 GeV. Finally, events 



5 



are rejected if they contain an isolated electron or muon, to suppress the contributions from 
W+jets, Z+jets and top-quark backgrounds. 

5 Search strategy 

The Mi2 variable is computed after applying the selection criteria of Section |4] We separately 
consider fully hadronic channels with >3 jets and a tight Mi2 requirement (the Mj2 analysis), 
which is mostly sensitive to signal regions with large squark and gluino masses, and channels 
with >4 jets, at least one tagged b jet, and a relaxed Mj2 requirement (the Mx2& analysis), which 
increases sensitivity to regions with small gluino and large squark masses. 

Given the event selection outlined above, we do not expect a significant number of QCD mul- 
tiset events to appear in the signal regions. Nonetheless, we estimate an upper limit on the 
remaining QCD multijet background in the signal regions from data control samples. The 
main backgrounds, consisting of W+jets, Z+jets, and top-quark production, are evaluated from 
data control samples and simulation. A common strategy is applied to both the Mi2 and Miib 
analyses: 

• Two regions are defined in Hj, a low Hj region 750 < Hj < 950 GeV and a high Hj 
region Hj > 950 GeV. In each region, several adjacent bins in Mt2 are defined: five 
bins for the Mj2 analysis and four for the Mj2& analysis. The lowest bin in Mt2 is 
chosen such that the expected QCD multijet background remains a small fraction of 
the total background. For the Mj2 analysis the lowest bin starts at Mj2 = 150 GeV 
and for M T2 b at M T2 = 125 GeV. 

• A dedicated method for each background is designed to estimate its contribution in 
the signal region from data control samples and simulation. The number of events 
and their relative systematic uncertainties are computed by means of these methods 
in each Hj, Mj2 bin. The methods are designed such that the resulting estimates are 
largely uncorrelated statistically. 

• The predicted number of events for all background components and their uncer- 
tainties are combined, resulting in an estimate of the total background yield and its 
uncertainty in each bin. 

• The estimated number of background events for each bin is compared to the number 
of observed events, and the potential contribution from a SUSY signal is quantified 
by a statistical method described in Section [8] 

6 M T 2 analysis 

Figure [I] shows the measured Mi2 distribution in comparison to simulation. For Mj2 < 80 GeV 
the distribution is completely dominated by QCD multijet events. For medium Mj2 values, 
the distribution is dominated by W+jets and Z(vv)+jets events with some contribution from 
top-quark events, while in the tail of Mj2 the contribution from top-quark production becomes 
negligible and Z(vi7)+jets together with W+jets events dominate. We observe good agreement 
between data and simulation in the core as well as in the tail of the distribution. The white 
histogram (black dotted line) corresponds to the LM6 signal. It can be noted that in the presence 
of signal, an excess in the tail of Mj2 is expected. 

The corresponding event yields for data and SM simulated samples, after the full selection 
and for the various bins in Mj2, are given in Table [I] for the low and the high Hj regions. 
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M T2 Analysis CMS,tfs = 7 TeV, L = 4.73 fb" 




M T2 [GeV] 

Figure 1: The Mj2 distribution with all selection requirements applied and Hj > 750 GeV. The 
different predictions for the SM backgrounds from simulation are stacked on top of each other. 
The LM6 signal distribution is not stacked. All distributions from simulation are normalized 
to the integrated luminosity of the data. 



Table 1: Observed number of events and expected SM background yields from simulation in 
Mi2 bins for the low and high Hj regions. These numbers are for guidance only and are not 
used in the final background prediction. 





QCD multijet 


W+jets 


Top 


Z(vv)+]ets 


Total SM 


Data 


750 <H T < 950 


Mj2 [0, oo] 


3.18e+05 


9.22e+02 


1.30e+03 


3.01e+02 


3.20e+05 


3.20e+05 


M T2 [150 / 200] 


3.08 


37.5 


20.6 


27.9 


90.0 


88 


M T2 [200,275] 


0.0 


20.6 


9.40 


20.3 


50.3 


69 


M T2 [275,375] 


0.0 


9.74 


2.74 


11.6 


24.1 


19 


M T2 [375,500] 


0.0 


3.63 


0.69 


6.07 


10.4 


8 


M T2 [500,00] 


0.0 


1.54 


0.20 


3.55 


5.29 


6 


Hj > 950 


M T 2 [0, oo] 


1.22e+05 


4.39e+02 


6.32e+02 


1.42e+02 


1.23e+05 


1.19e+05 


M T2 [150 / 200] 


9.84 


19.8 


11.7 


12.9 


54.2 


70 


M T2 [200,275] 


0.47 


13.7 


5.25 


10.5 


30.0 


23 


M T2 [275,375] 


0.04 


6.43 


1.83 


6.42 


14.7 


9 


M T2 [375,500] 


0.0 


1.63 


0.40 


2.54 


4.57 


8 


M T2 [500,oo] 


0.0 


1.10 


0.16 


2.16 


3.42 
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Contributions from other backgrounds, such as 7+jets, Z(££)+)ets and diboson production, are 
found to be negligible. It is seen that for all but one Mj2 bin, the observed number of events 
agrees within the uncertainties with the SM background expectation from simulation. In the 
low Hj region, the Mj2 bin [200, 275] GeV exhibits an excess in data compared to background. 
We investigated whether the origin could be instrumental in nature, but did not find evidence 
for it. It could be of statistical origin. The excess has a marginal impact on the final observed 
limit. 

6.1 Background prediction 
6.1.1 QCD multijet background 

The simulation predicts that the QCD multijet background is negligible in the tail of the Mj2 
distribution. Nevertheless, a dedicated method using a data control region was designed to 
verify that this is indeed the case. 

We base this estimation on Mi2 and A(p m in, which is the difference in azimuth between E™ 1SS 
and the closest jet. The background in the signal region, defined by A</> m j n > 0.3 and large Mi2, 
is predicted from a control region with A</> m j n < 0.2. The two variables are strongly correlated, 
but a factorization method can still be applied if the functional form is known for the ratio of 
the number of events r(M T2 ) = N(A^ min > 0.3)/N(A^ min < 0.2) as a function of M T2 . It is 
found from simulation studies, and confirmed with data, that for Mj2 > 50 GeV the ratio falls 
exponentially. Therefore, a parameterization of the form 

... , N(A<£ min > 0.3) ... 
f(MT2) = N(A4W < 0.2) = (fl " T2 ^ + c © 

is used for Mj2 > 50 GeV. The function is assumed to reach a constant value at large Mj2 due 
to extreme tails of the jet energy resolution response. 

The method is validated with simulation. First the parameters a, b, and c are extracted from 
a fit to simulated QCD multijet events in the full Mj2 spectrum. The fitted parameter value 
for c is compatible with a negligible QCD multijet contribution at large Mj2- It is verified that 
similar fit results for the parameters a and b are obtained when the fit is limited to the region 
50 < Mj2 < 80 GeV, where contributions from background processes other than that from 
QCD multijets is small. The robustness of the prediction is checked by systematically varying 
the fit boundaries. 

For the final results, we repeat the fit to data in the region 50 < Mj2 < 80 GeV, after subtracting 
the W+jets, Z+jets and top background contributions using simulation. The fitted parameter 
values for a and b are in agreement with the values obtained from the QCD multijet simulation. 
We conservatively fix the constant c to the value of the exponential at Mj2 = 250 GeV, where 
agreement with data can still be verified. In the lower Mj2 bins, where the exponential term 
dominates, the method reliably predicts the QCD multijet background. For higher Mj2 bins, 
where the constant term dominates, the method overestimates the number of QCD multijet 
events relative to the simulation, nonetheless confirming that the QCD multijet contribution is 
negligible. 

The extreme case of total loss of a jet, leading to population of the high Mj2 tail, is studied 
using a sample of high pj mono-jet events obtained with a dedicated event selection. The total 
number of events is found to be compatible within the uncertainties with the number expected 
from the electroweak processes, confirming that the QCD multijet contribution is negligible 
and hence that the constant c is small. 
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6.1 .2 W(£t/)+jets and top-quark background 

The backgrounds due to W(£v)+jets and to semi-leptonic decays of top quarks have the follow- 
ing sources in common: 

• leptonic decays of the W boson, where the lepton is unobserved because it falls out- 
side the pj or r] acceptance; 

• to a lesser extent, leptonic decays of the W boson, where the lepton is within the 
acceptance but fails to satisfy the reconstruction, identification, or isolation criteria; 

• W(tv t ) decays, where the T decays hadronically. 

We refer to leptons that fall into either of the first two categories as "lost leptons". The number 
of events with lost leptons is estimated from a data control sample where a single lepton (e 
or fi) is found. A correction factor accounting for the probability to lose the lepton is derived 
from simulation. To avoid a potential contamination from signal events, a transverse mass 
cut Mj < 100 GeV is introduced. This method is applied in the various Hj and Mj2 bins. 
First, a successful validation test of the method is performed using simulated samples. Then, a 
prediction is made from the data bin by bin and found to be in agreement with the expectation 
from simulation. A systematic uncertainty is evaluated that includes the uncertainty on the 
lepton efficiencies, acceptance, and background subtraction. 

For the background contribution from hadronically decaying tau leptons, a method similar to 
the one described above is used. Events with an isolated and identified hadronically decay- 
ing tau |39| lepton are selected in the various Hj and Mj2 bins. The contribution from jets 
misidentified as taus is subtracted. The remaining number of tau events is corrected for the tau 
reconstruction and identification efficiency. The predicted number of hadronically decaying 
tau background events agrees with the true number from simulation. Given the small number 
of events in the data, the numbers of events from the simulation are used for the background 
estimate, with the same relative systematic uncertainties as for the lost leptons. 

6.1.3 Z(vv)+]ets background 

The estimate of the Z(vv)+jets background is obtained independently from two distinct data 
samples, one containing 7+jets events and the other W(^i/)+jets events. In both cases the in- 
visible decay of the Z boson is mimicked by removing, respectively, the photon and the muon 
from the event, and adding vectorially the corresponding p T to £™ 1SS . 

For the estimate based on 7+jets events, a sample of events with identified and isolated pho- 
tons [40 j with pj > 20 GeV is selected, where all selection requirements except that on Mj2 
are imposed. This sample contains both prompt photons and photons from tz° decays in QCD 
multijet events. The two components are separated by performing a maximum likelihood fit 
of templates from simulated events to the shower shapes. The event sample is dominated by 
low pj photons, where the shower shape provides high discrimination power between prompt 
photons and 7T°s. The extrapolation of their contributions as a function of Mj2 is obtained from 
simulation. The Z(vi7)+jets background is estimated for each bin in Mj2 from the number of 
prompt photon events multiplied by the Mx2-dependent ratio of Z(t/v)+jets to 7+jets events 
obtained from simulation. This ratio increases as a function of the photon pj (which drives 
the Mj2 value) and reaches a constant value above 300 GeV. The resulting prediction of the 
background is found to be in good agreement with the expectation from simulation. System- 
atic uncertainties on the background prediction consist of the statistical uncertainties from the 
number of 7+jets events, a normalization uncertainty in the shower shape fit of 5%, and the 
systematic uncertainties on the ratio of Z(i/i7)+jets to 7+jets events in the simulation. The un- 
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certainties on the ratio are estimated to be less than 20% (30%) for M T2 < 275 (M T2 > 275) GeV. 
To assess these uncertainties, the pj dependence of the ratio is studied in data and compared 
to simulation using leptonically decaying Z events. For pj > 400 GeV this test is limited by the 
number of the leptonic Z events, which justifies the increased uncertainty for Mj2 > 275 GeV. 

For the estimate from VJ(jiv)+)ets events, corrections are needed for lepton acceptance, lep- 
ton reconstruction efficiency, and the ratio between the production cross sections for W and Z 
bosons (including differences between the shapes of the distributions on which selection crite- 
ria are applied). The lepton efficiencies are taken from studies of TL{\i]i) events in data. Also, 
the top-quark background to the W+jets sample is subtracted. The top-quark background is 
evaluated by applying b tagging to the data to identify top-quark decays and then correcting 
for the b-tagging efficiency. The Z(i/i/)+jets background is then estimated in each of the Mj2 
bins. The systematic uncertainty includes the contributions from the lepton selection and re- 
construction efficiencies, the the b-tagging efficiency, the acceptance from simulation, and the 
W-to-Z ratio. 

The Z(i/t/)+jets background estimates from the 7+jets and W(/^v)+jets methods are in good 
agreement with each other. Since they are statistically uncorrelated, we take the weighted 
average of the two predictions as the final estimate. 

6.2 Results 

The results of the background estimation methods for each background contribution are sum- 
marized in Tableland shown in Fig. [2] 

Table 2: Estimated event yields for each background contribution in the various Mj 2 and Hj 
bins. The predictions from control regions in data are compared to the expected event yields 
from simulation. Statistical and systematic uncertainties are added in quadrature. The total 
background prediction is compared to data in the last two columns. 





Z — > vv 
sim. data pred. 


Lost lepton 
sim. data pred. 


t — > had 
Estimate 


QCD multijet 
sim. data pred. 


Total bkg. 
data pred. 


Data 


750 < Hj < 950 




M T2 [150,200] 


27.9 


24.2 ± 4.9 


36.0 


29.6 ±7.1 


22.5 ± 5.4 


3.1 


7.0 ±3.5 


83.3 ± 10.7 


88 


M T2 [200,275] 


20.3 


21.8 ±4.8 


17.2 


11.9 ±3.9 


12.7 ±4.2 


0.0 


1.0 ±0.5 


47.4 ± 7.5 


69 


Mt2[275,375] 


11.6 


13.7 ±3.8 


7.1 


4.2 ±1.9 


5.4 ±2.5 


0.0 


0.14 ± 0.07 


23.4 ± 4.9 


19 


M T2 [375,500] 


6.1 


4.1 ±1.6 


2.2 


1.1 ±0.9 


2.2 ±1.8 


0.0 


0.08 ± 0.05 


7.4 ± 2.6 


8 


M T2 [500, 00] 


3.5 


1.8 ±0.9 


1.1 


1.2 ±1.0 


0.6 ± 0.5 


0.0 


0.00 ± 0.00 


3.6 ±1.4 


6 


Hj > 950 




M T2 [150,200] 


12.9 


16.7 ±3.6 


18.7 


16.2 ±5.3 


12.7 ±4.1 


9.8 


11.0 ±5.5 


56.6 ± 9.4 


70 


M T2 [200,275] 


10.5 


4.5 ±2.0 


11.7 


10.2 ±3.7 


7.1 ±2.6 


0.47 


1.4 ±0.7 


23.2 ±5.0 


23 


M T 2[275,375] 


6.4 


5.7 ±2.2 


5.0 


2.9 ±1.7 


3.3 ±1.9 


0.04 


0.13 ± 0.07 


12.1 ±3.3 


9 


^4x2(375,500] 


2.5 


3.0 ±1.4 


1.1 


0.6 ±0.6 


0.9 ±0.9 


0.0 


0.06 ± 0.04 


4.6 ±1.8 


8 


M T2 [500, 00] 


2.2 


2.5 ±1.5 


0.6 


0.6 ± 0.6 


0.6 ±0.6 


0.0 


0.06 ± 0.04 


3.8 ±1.7 


4 



7 M T2 b analysis 

The selection criteria developed for the Mj 2 analysis are not optimal for events with heavy 
squarks and light gluinos, such as are predicted by the SUSY benchmark model LM9. To im- 
prove sensitivity to these types of events, we perform the Mj 2 b analysis based on loosened 
kinematic selection criteria and the requirement of a tagged b jet. The restriction on Mj 2 is 
loosened to Mj2 > 125 GeV and the A</> m i n (jets, E™ 1SS ) > 0.3 requirement is applied to the four 
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Figure 2: Mj2 distribution from the background estimates compared to data. The figure on 
the left corresponds to the 750 < Hj < 950 GeV region, while that on the right corresponds to 
Hj > 950 GeV. The predictions from simulated events for the LM6 signal model (not stacked) 
are also shown. The hatched band shows the total uncertainty on the SM background estimate. 

leading jets only. We require that there be at least four jets with pj > 40 GeV, and the leading 
jet to have pj > 150 GeV. We further require that at least one of the jets in the event be tagged 
as a b-quark jet. 

Figure [3] shows the Mj2 distribution for events that satisfy the hAj^b selection criteria and with 
Hj > 750 GeV. As for the Mj2 analysis (Fig. [TJ, the QCD multijet background dominates 
for Mj2 < 80 GeV but is strongly suppressed for Mj2 > 125 GeV. In the signal region, top- 
quark events dominate the electroweak contribution. The white histogram (black dotted line) 
corresponds to the LM9 signal. The corresponding event yields for data and SM simulation for 
the low and high Hj regions are summarized in Table |3j 



Table 3: Observed number of events and expected SM background event yields from simulation 
in the various Mt2 bins for the Mt2& event selection. These numbers are for guidance only and 
are not used in the final background prediction. 





QCD multijet 


W+jets 


Top 


Z(m/)+jets 


Total SM 


Data 


750 < H T < 950 


M T 2[0,oo] 


2.83e+04 


4.53e+02 


1.15e+03 


1.41e+02 


2.97e+04 


2.99e+04 


M T2 [125,150] 


5.16 


1.86 


20.3 


0.95 


28.3 


22 


Mt 2 [150,200] 


0.16 


1.94 


17.9 


2.00 


22.1 


16 


Mt 2 [200,300] 


0.0 


1.84 


9.43 


1.25 


12.6 


16 


Mt 2 [300,oo] 


0.0 


0.57 


2.55 


0.53 


3.65 


2 


Hj > 950 


M T2 [0,oo] 


1.19e+04 


2.18e+01 


5.46e+02 


6.51e+00 


1.25e+04 


1.23e+04 


M T2 [125,150] 


1.25 


0.76 


9.95 


0.64 


12.7 


10 


M T2 [150,180] 


0.57 


0.79 


7.15 


0.43 


8.96 


10 


M T2 [180,260] 


0.67 


1.09 


6.62 


0.68 


9.06 


9 


M T2 [260,oo] 


0.04 


0.76 


3.09 


0.65 


4.55 


3 



7.1 Background prediction and results 
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Figure 3: Mi2 for events with the Mt2& selection criteria applied and with Hj > 750 GeV. The 
different predictions from simulation for the SM backgrounds are stacked on top of each other. 
The LM9 signal distribution is not stacked. All distributions from simulation are normalized 
to the integrated luminosity of the data. 



7.1 Background prediction and results 

The QCD multijet contribution is estimated following the same approach as for the Mj2 analy- 
sis. We find that the function in Eq. ([5} fitted to data in the region 50 < Mj2 < 80 GeV provides 
a good description of the QCD multijet background, also for events containing b-tagged jets. 
From the fit to data, the prediction of the QCD multijet background is obtained in the various 
Mi2 bins for the low and high Hj regions. 

Events arising from top-quark production are the dominant background contribution in the 
signal region. The top-quark contribution is evaluated, together with the one from W(^i/)+jets, 
in the same way as for the Mj2 analysis, using single-electron and single-muon events, as well 
as taus decaying to hadrons. 

The background from Z(vv)+)ets events is expected to be very small compared with the back- 
ground from top-quark events. We estimate the background from Z(vI7)+jets events with the 
method based on W+jets events discussed for the Mj2 analysis. As the selection of W(f/i/)+jets 
events includes a b-tag veto to suppress the top-quark background, a ratio of efficiencies for 
W(f/v)+jets events with a b tag to W(^v)+jets events without a b tag is taken into account. This 
ratio is obtained from simulation. 

The results of the estimates for the various backgrounds are summarized in Tableland shown 
in Fig. H 
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8 Statistical Interpretation of the results and exclusion limits 



Table 4: Estimated event yields for each background contribution in the various Mj2 and Hj 
bins. The predictions from control regions in data are compared to the expected event yields 
from simulation. Statistical and systematic uncertainties are added in quadrature. The total 
background prediction is compared to data in the last two columns. 
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data pred. 
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data pred. 


Estimate 
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data pred. 
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5.16 
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22 
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16 
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1.3 
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0.0 


0.04 ± 0.03 
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Figure 4: Mj2 distribution from the background estimates compared to data for the Mi?b se- 
lection. The figure on the left corresponds to the 750 < Hj < 950 GeV region, while that on 
the right corresponds to Hj > 950 GeV. The prediction from simulation for the LM9 signal 
model (not stacked) are also shown. The hatched band shows the total uncertainty on the SM 
background estimate. 



8 Statistical Interpretation of the results and exclusion limits 

No significant deviation from the SM background prediction is observed and upper limits are 
set on a potential signal. The statistical approach used to derive limits follows closely the 
methodology of Ref. [41 j . A brief description of the steps relevant to this analysis follows. 

First, a likelihood function is constructed as the product of Poisson probabilities for each Hj, 
Mi2 search bin. These probabilities are functions of the predicted signal and background yields 
in each bin. Systematic uncertainties are introduced as nuisance parameters in the signal and 
background models. Log-normal distributions are taken as a suitable choice for the probability 
density distributions for the nuisance parameters. 



8.1 Exclusion limits in the CMSSM plane 
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In order to compare the compatibility of the data with the background-only and the signal- 
plus-background hypotheses, we construct the test statistic q\ based on the profile likelihood 
ratio: 

, A —21, ^1^ , wi,hO<A<A, (6) 
£ (data | A, 9) 

where the signal strength modifier A is introduced to test signal cross section values a = Ac sig . 
Both the denominator and the numerator are maximized. In the numerator, the signal parame- 
ter strength A remains fixed and the likelihood is maximized only for the nuisance parameters, 
whose values at the maximum are denoted 9\. In the denominator, the likelihood is maxi- 
mized for both A and 9. A and 9 denote the values at which £ reaches its global maximum in 
the denominator. The lower constraint < A is imposed because the signal strength cannot 
be negative, while the upper constraint A < A guarantees a one-sided confidence interval. The 
value of the test statistic for the actual observation is denoted ^° bs . This test statistic EH differs 
from that used at LEP and the Tevatron. 

To set limits, a modified frequentist CL S approach is employed |42l l43l. We first define the 
probabilities to obtain an outcome of an experiment at least as signal-like as the one observed 
for the background-only and for the signal-plus-background hypotheses. The CL S quantity is 
then defined as the ratio of these two probabilities. In the modified frequentist approach, the 
value of CL S is required to be less than or equal to a in order to establish a (1 — a) confidence 
level (CL) exclusion. To quote the upper limit on A for a given signal at 95% CL, we adjust A 
until we reach CL S = 0.05. 



8.1 Exclusion limits in the CMSSM plane 

Exclusion limits at 95% CL are determined in the CMSSM (mo r m 1 / 2 ) plane |44) . The sig- 
nal cross section is calculated at NLO and next-to-leading-log (NLL) accuracy M26l l45l l46l At 
each point in the scan four CL S values are computed for A = 1: the observed, the median ex- 
pected, and the one standard deviation (±lc) expected bands. If the corresponding CL S value 
is smaller than 0.05, the point is excluded at 95% CL, resulting in the exclusion limits shown 
in Fig. [5] The results from both the Mj2 and hAijb selections are shown in Fig. [5] (top). In 
Fig. [5] (bottom), the results are combined into a single limit exclusion curve based on the best 
expected limit at each point of the plane. 

The dominant sources of systematic uncertainties on the signal model are found to be the jet 
energy scale and (for the Mj 2 b analysis) the b-tagging efficiency. These two uncertainties are 
evaluated at each point of the CMSSM plane, typically ranging from 5 to 25% for the former 
and from 2 to 6% for the latter. Additionaly a 2.2% uncertainty is associated with the luminos- 
ity determination (47l. All these uncertainties are included in the statistical interpretation as 
nuisance parameters on the signal model. 

Observed exclusion limits are also determined when the signal cross section is varied by chang- 
ing the renormalization and factorization scales by a factor of 2 and using the PDF4LHC rec- 
ommendation |l48| for the PDF uncertainty. The exclusion contours obtained from this method 
are shown by the dashed curves of Fig.|5]and referred to as theory uncertainties. 

The effect of signal contamination in the leptonic control region could be significant, yielding a 
potential background overprediction of about 1-15%. To account for this effect, the signal yields 
are corrected by subtracting the expected increase in the background estimate that would occur 
if the given signal were present in the data. 

The results in Fig. [5] (top) establish that the Mt2 analysis is powerful in the region of large 
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8 Statistical Interpretation of the results and exclusion limits 




m [GeV] 

Figure 5: Top: exclusion limit in the CMSSM (mo, mi/2) plane for the Mj2 and Mi?b analyses 
with tan /3 = 10. Bottom: Combined limit based on the best expected limit at each point. 



squark and gluino masses, corresponding to small m$ and large nii/2, while the Mx2& analysis 
increases sensitivity to large squark and small gluino masses, corresponding to large niQ and 
small ni\/ 2- Conservatively, using the minus one standard deviation (— lcr) theory uncertainty 
values of the observed limit, we derive absolute lower limits on the squark and gluino masses 
for the chosen CMSSM parameter set. We find lower limits of m(q) > 1110 GeV and m(g) > 
800 GeV, as well as m(q) = m(g) > 1180 GeV assuming equal squark and gluino masses. 



8.2 Exclusion limits for simplified model spectra 
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8.2 Exclusion limits for simplified model spectra 

In this section we interpret the results in terms of simplified model spectra |49l| , which allow a 
presentation of the exclusion potential in the context of a larger variety of fundamental models, 
not necessarily in a supersymmetric framework. We studied the following topologies: 

• gluino pair production, with g — > qqx°; 

• gluino pair production, with g — > bb^°; 

• gluino pair production, with g — > tt^°; 

• gluino pair production, with g — > qqZ^°. 

The last of these models is used to demonstrate the sensitivity of the analysis in a high jet 
multiplicity topology, since the hadronic decay of the Z boson can lead to (maximally) 8 jets 
in the final state. In Fig. [6] the 95% CL excluded cross sections are reported as a function of 
the relevant masses for gluino pair production with g — > qqx using the Mj2 analysis, and 
for g — > bbx°, g — > tt^° and g — > qqZ^° using the Mi?b analysis. Systematic uncertainties 
on jet energy scale and on b-tagging efficiencies are taken into account as nuisance parameters 
on the signal model. To minimize the effect of ISR modeling uncertainties, the region near 
the diagonal is excluded in the limit setting. Observed, median expected, and one standard 
deviation (±lc experimental) expected limit curves are derived for the nominal signal cross 
section. Also shown are the ±lc variation in the observed limit when the signal cross section 
is varied by its theoretical uncertainties. 

9 Summary 

We have conducted a search for supersymmetry or similar new physics in hadronic final states 
using the Mj2 variable calculated from massless pseudojets. Mj2 is strongly correlated with 
E™ 1SS for SUSY processes, yet provides a natural suppression of QCD multijet background. The 
data set for this analysis corresponds to 4.73 fb -1 of integrated luminosity in \Js = 7TeV pp 
collisions collected with the CMS detector during the 2011 LHC run. All candidate events are 
selected using hadronic triggers. Two complementary analyses are performed. The Mj2 analy- 
sis targets decays of moderately heavy squarks and gluinos, which naturally feature a sizeable 
£miss Thjg analysis is based on events containing three or more jets and no isolated leptons. We 
show that the tail of the Mj2 distribution, obtained after this selection, is sensitive to a potential 
SUSY signal. A second approach, the Mx2& analysis, is designed to increase the sensitivity to 
events with heavy squarks and light gluinos, in which the E™ 1SS tends to be smaller. Therefore, 
the restriction on Mj2 is relaxed. The effect of the loosened Mj2 is compensated by requiring 
at least one b-tagged jet and a larger jet multiplicity, to suppress the QCD multijet background. 
For both analyses, the standard model backgrounds, arising from QCD multijet, electroweak, 
and top-quark production processes, are obtained from data control samples and simulation. 
No excess beyond the standard model expectations is found. Exclusion limits are established 
in the CMSSM parameter space, as well as for some simplified model spectra. Conservatively, 
using the minus one standard deviation (— lc) theory uncertainty values, absolute mass limits 
in the CMSSM scenario for tan = 10 are found to be m(q) > 1110 GeV and m(g) > 800 GeV, 
and m(q) = m(g) > 1180 GeV assuming equal squark and gluino masses. 
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Figure 6: Exclusion limits for simplified model spectra. Upper left: gluino pair production with 
g —¥ qq^° using the Mja analysis. Upper right: gluino pair production with g — > bbx°, using 
the Mt2& analysis. Lower left: gluino pair production with g — > tt^°, using the Mj2& analysis. 
Lower right: gluino pair production with g — > qqZft , using the Mj2b analysis. The signal 
production cross sections are calculated at NLO and NLL accuracy 
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